A likelihood framework to analyse phyletic patterns.

نویسندگان

  • Ofir Cohen
  • Nimrod D Rubinstein
  • Adi Stern
  • Uri Gophna
  • Tal Pupko
چکیده

Probabilistic evolutionary models revolutionized our capability to extract biological insights from sequence data. While these models accurately describe the stochastic processes of site-specific substitutions, single-base substitutions represent only a fraction of all the events that shape genomes. Specifically, in microbes, events in which entire genes are gained (e.g. via horizontal gene transfer) and lost play a pivotal evolutionary role. In this research, we present a novel likelihood-based evolutionary model for gene gains and losses, and use it to analyse genome-wide patterns of the presence and absence of gene families. The model assumes a Markovian stochastic process, where gains and losses are represented by the transition between presence and absence, respectively, given an underlying phylogenetic tree. To account for differences in the rates of gain and loss of different gene families, we assume among-gene family rate variability, thus allowing for more accurate description of the data. Using the Bayesian approach, we estimated an evolutionary rate for each gene family. Simulation studies demonstrated that our methodology accurately infers these rates. Our methodology was applied to analyse a large corpus of data, consisting of 4873 gene families spanning 63 species and revealed novel insights regarding the evolutionary nature of genome-wide gain and loss dynamics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimation of Gene Insertion/Deletion Rates with Missing Data.

Lateral gene transfer is an important mechanism for evolution among bacteria. Here, genome-wide gene insertion and deletion rates are modeled in a maximum-likelihood framework with the additional flexibility of modeling potential missing data. The performance of the models is illustrated using simulations and a data set on gene family phyletic patterns from Gardnerella vaginalis that includes a...

متن کامل

Technology Spillovers of FDI in ASEAN Sourcing from Local and Abroad

 The effect of technology spillovers is widely considered as one of the main channels through which domestic firms benefit from FDI, and plays an important role in economic development of host countries. Based on the analysis framework for technology spillovers established by Borensztein et al. (1998), this paper will analyse and try to figure out the development patterns of ASEAN by utilizing ...

متن کامل

A Bayesian Nominal Regression Model with Random Effects for Analysing Tehran Labor Force Survey Data

Large survey data are often accompanied by sampling weights that reflect the inequality probabilities for selecting samples in complex sampling. Sampling weights act as an expansion factor that, by scaling the subjects, turns the sample into a representative of the community. The quasi-maximum likelihood method is one of the approaches for considering sampling weights in the frequentist framewo...

متن کامل

A Framework for Exploring the Frequent Patterns based on Activities Sequence

In recent years, the development of the use of location-based tools has made it possible to produce geometric trajectories from the user's movement paths. In this way, users' goal of traveling and related activities can be considered in addition to the geometry and route shape. the user activity trajectory represents the sequence of the visited activities and its related analysis as presented i...

متن کامل

Body mass and temperature influence rates of mitochondrial DNA evolution in North American cyprinid fish.

The mass-specific metabolic rate hypothesis of Gillooly and others predicts that DNA mutation and substitution rates are a function of body mass and temperature. We tested this hypothesis with sequence divergences estimated from mtDNA cytochrome b sequences of 54 taxa of cyprinid fish. Branch lengths estimated from a likelihood tree were compared with metabolic rates calculated from body mass a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Philosophical transactions of the Royal Society of London. Series B, Biological sciences

دوره 363 1512  شماره 

صفحات  -

تاریخ انتشار 2008